4 research outputs found

    Fast Search-By-Classification for Large-Scale Databases Using Index-Aware Decision Trees and Random Forests

    Full text link
    The vast amounts of data collected in various domains pose great challenges to modern data exploration and analysis. To find "interesting" objects in large databases, users typically define a query using positive and negative example objects and train a classification model to identify the objects of interest in the entire data catalog. However, this approach requires a scan of all the data to apply the classification model to each instance in the data catalog, making this method prohibitively expensive to be employed in large-scale databases serving many users and queries interactively. In this work, we propose a novel framework for such search-by-classification scenarios that allows users to interactively search for target objects by specifying queries through a small set of positive and negative examples. Unlike previous approaches, our framework can rapidly answer such queries at low cost without scanning the entire database. Our framework is based on an index-aware construction scheme for decision trees and random forests that transforms the inference phase of these classification models into a set of range queries, which in turn can be efficiently executed by leveraging multidimensional indexing structures. Our experiments show that queries over large data catalogs with hundreds of millions of objects can be processed in a few seconds using a single server, compared to hours needed by classical scanning-based approaches

    End-to-End Neural Network Training for Hyperbox-Based Classification

    Full text link
    Hyperbox-based classification has been seen as a promising technique in which decisions on the data are represented as a series of orthogonal, multidimensional boxes (i.e., hyperboxes) that are often interpretable and human-readable. However, existing methods are no longer capable of efficiently handling the increasing volume of data many application domains face nowadays. We address this gap by proposing a novel, fully differentiable framework for hyperbox-based classification via neural networks. In contrast to previous work, our hyperbox models can be efficiently trained in an end-to-end fashion, which leads to significantly reduced training times and superior classification results.Comment: 6 pages, accepted for poster presentation at ESANN 202

    Artificial Intelligence in Education

    No full text
    This book contains a collection of 19 systematic literature reviews conducted by Cognitive Science students in the "Artificial Intelligence in Education" seminar in the winter term of 2021/2022. From a Cognitive Science and Artificial Intelligence (AI) perspective, the book investigates the state of the art of research on applying AI technology in educational settings as well as the strengths, weaknesses, opportunities, and threats of these applications. The four larger areas covered by the studies are "Didactics and Ethics", "Methods and Technologies", "Extended Reality and Robots", and "Addressing Special Needs"

    Artificial Intelligence in Public Discourse

    No full text
    This book contains 26 studies conducted by students in the Cognitive Science seminar "Artificial Intelligence in Public Discourse". In their studies, they explore the use of the term Artificial Intelligence (AI) and related subfields in various parts of public discourse such as Twitter, user comments on news sites, expert interviews, government documents, television shows, newspapers, etc. It is investigated which strengths, weaknesses, opportunities, and threats are ascribed to AI technology and how this relates to the technical and academic state of the art and discussion. Most studies employ qualitative methods, but quantitative and mixed-methods approaches are also used
    corecore